Rebalancing Docked Bicycle Sharing System with Approximate Dynamic Programming and Reinforcement Learning
نویسندگان
چکیده
The bicycle, an active transportation mode, has received increasing attention as alternative in urban environments worldwide. However, effectively managing the stock levels of rental bicycles at each station is challenging demand vary with time, particularly when users are allowed to return any station. There a need for system-wide management bicycle by transporting available from one another. In this study, rebalancing model based on Markov decision process (MDP) developed using real-time dynamic programming method and reinforcement learning considering system characteristics. pickup demands stochastic continuously changing. As result, proposed framework suggests best operation option every 10 min realized variables future predicted random forest method, minimizing expected unmet demand. Moreover, we adopt custom prioritizing strategies reduce number action candidates operator computational complexity practicality MDP framework. Numerical experiments demonstrate that outperforms existing methods, such short-term static lookahead policies. Among suggested strategies, focusing stations larger error prediction was found be most effective. Additionally, effects various safety buffers were examined.
منابع مشابه
Approximate Dynamic Programming and Reinforcement Learning
Dynamic programming (DP) and reinforcement learning (RL) can be used to address problems from a variety of fields, including automatic control, artificial intelligence, operations research, and economy. Many problems in these fields are described by continuous variables, whereas DP and RL can find exact solutions only in the discrete case. Therefore, approximation is essential in practical DP a...
متن کاملReinforcement Learning And Approximate Dynamic Programming For Feedback Control
feedback control of dynamic systems 6th solution PDF feedback control of dynamic systems 6th solutions PDF feedback control of dynamic systems 5th edition pdf PDF feedback control of dynamic systems solution PDF feedback control of dynamic systems 7th edition PDF feedback control of dynamic systems 6th edition PDF feedback control of dynamic systems solutions PDF feedback control of dynamic sys...
متن کاملEditorial: Special Section on Reinforcement Learning and Approximate Dynamic Programming
Approximate dynamic programming (ADP) is to compute near-optimal solutions to Markov decision problems (MDPs) with large or continuous spaces. In recent years, the research works on ADP have been brought together with the reinforcement learning (RL) community [1-4]. RL is a machine learning framework for solving sequential decision making problems that can also be modeled as the MDP formalism. ...
متن کاملA Public Bicycle Sharing System Considering Renting and Middle Stations
Recently, public bicycle sharing system (PBSS) has become one of the most favorite urban transportation systems that can help governments to decrease environmental problems such as pollution and traffic. This paper studies a sharing system that includes two types of stations. The first category contains stations that users can rent or return back bicycles and each bicycle can be rented by any n...
متن کاملOptimal Learning and Approximate Dynamic Programming
Approximate dynamic programming (ADP) has emerged as a powerful tool for tackling a diverse collection of stochastic optimization problems. Reflecting the wide diversity of problems, ADP (including research under names such as reinforcement learning, adaptive dynamic programming and neuro-dynamic programming) has become an umbrella for a wide range of algorithmic strategies. Most of these invol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Advanced Transportation
سال: 2022
ISSN: ['0197-6729', '2042-3195']
DOI: https://doi.org/10.1155/2022/2780711